Design and Realization of the EXCITEMENT Open Platform for Textual Entailment

نویسندگان

  • Günter Neumann
  • Sebastian Padó
چکیده

Textual Entailment (TE) is a binary relation between two natural language text which holds if the truth of a first text implies the truth of the second one, or at least makes it very likely. Good methods to recognize TE have the potential to impact many NLP tasks, where the ability to draw conclusions from textual expressed facts is a key challenge. The area of TE has seen the development of a range of algorithms, methods, and technologies over the last decade. Unfortunately, research on TE (like semantics research more generally), is fragmented into studies focussing on various aspects of semantics such as world knowledge, lexical and syntactic relations, and so on. This fragmentation has problematic practical consequences. Notably, interoperability among existing RTE systems is poor, and reuse of resources and algorithms is mostly infeasible. This also makes systematic evaluations very difficult to carry out. Finally, TE presents a wide array of approaches to potential end users with little guidance on which to pick. Our contribution to this situation is a novel architecture and platform, the EXCITEMENT Open Platform (EOP), which was developed to enable and encourage the consolidation of methods and resources in the TE area. Starting out from and generalizing over three existing systems (BIUTEE, EDITS, and TIE), our architecture decomposes RTE into components with strongly typed interfaces. The specifications cover (a) a modular linguistic analysis pipeline and (b) a decomposition of the ”core” RTE methods into top-level algorithms and subcomponents. We identify four major subcomponent types, including different kinds of knowledge bases. The architecture was developed with a focus on generality, supporting all major approaches to RTE, as well as encouraging language independence. The practical implementation of this architecture forms the EXCITEMENT open platform (EOP). It is a suite of textual entailment algorithms and components which contains the three systems named above, including linguistic-analysis pipelines for three languages (English, German, and Italian), and comprises a number of linguistic resources. By addressing the problems outlined above, the platform provides a comprehensive and flexible basis for research and experimentation in Textual Entailment. We discuss the current scope and functionality of the platform, which is available as free open source software, and outline existing and future use cases.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using UIMA to Structure An Open Platform for Textual Entailment

EXCITEMENT is a novel, open software platform for Textual Entailment (TE) which uses the UIMA framework. This paper discusses the design considerations regarding the roles of UIMA within EXCITEMENT Open Platform (EOP). We focus on two points: a) how to best design the representation of entailment problems within UIMA CAS and its type system. b) the integration and usage of UIMA components among...

متن کامل

EXploring Customer Interactions through Textual EntailMENT

Description EXCITEMENT is a 3-year research project (1/2012-12/2014) funded by the European Commission under FP7. The project consortium includes NICE Systems LTD (Israel) as coordinator, four academic partners, University of Bar Ilan (Israel), DFKI (Germany), FBK (Italy), University of Heidelberg (Germany), and two industrial partners, Almawave S.R.L. (Italy) and OMQ GmbH (Germany). The main t...

متن کامل

Passing a USA National Bar Exam: a First Corpus for Experimentation

Bar exams provide a key watershed by which legal professionals demonstrate their knowledge of the law and its application. Passing the bar entitles one to practice the law in a given jurisdiction. The bar provides an excellent benchmark for the performance of legal information systems since passing the bar would arguably signal that the system has acquired key aspects of legal reason on a par w...

متن کامل

Entailment graphs for text exploration

Taxonomy-based representations are widely used to model compactly large amounts of textual data. While current methods allow organizing knowledge at the lexical level (keywords/concepts/topics), there is an increasing demand to move towards more informative representations, which express properties of concepts and relations among them. This demand triggered our research on statement entailment ...

متن کامل

The Excitement Open Platform for Textual Inferences

This paper presents the Excitement Open Platform (EOP), a generic architecture and a comprehensive implementation for textual inference in multiple languages. The platform includes state-of-art algorithms, a large number of knowledge resources, and facilities for experimenting and testing innovative approaches. The EOP is distributed as an open source software.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013